AITopics

2603.29715

Country:

Europe > United Kingdom (0.04)
Europe > Belgium (0.04)
North America > United States > Utah (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Sports (0.93)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)

Jianhao Peng, Olgica Milenkovic, Abhishek Agarwal

Online Convex Matrix Factorization with Representative Regions

Neural Information Processing SystemsFeb-15-2026, 07:40:27 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, dataset, matrix factorization, (12 more...)

Country:

North America > United States > Illinois > Champaign County > Urbana (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > Middle East > Jordan (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Neural Information Processing SystemsFeb-14-2026, 20:17:26 GMT

Dropping Symmetry for Fast Symmetric Nonnegative Matrix Factorization

Zhihui Zhu, Xiao Li, Kai Liu, Qiuwei Li

Because of the ubiquitous applications of NMF, many efficient algorithms have been proposedforsolving(1).

algorithm, artificial intelligence, machine learning, (19 more...)

Country:

North America > United States > Colorado > Jefferson County > Golden (0.15)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Maryland > Baltimore (0.04)
North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.96)

Thanh, Olivier Vu, Gillis, Nicolas

Maximum-Volume Nonnegative Matrix Factorization

arXiv.org Machine LearningFeb-6-2026

Nonnegative matrix factorization (NMF) is a popular data embedding technique. Given a nonnegative data matrix $X$, it aims at finding two lower dimensional matrices, $W$ and $H$, such that $X\approx WH$, where the factors $W$ and $H$ are constrained to be element-wise nonnegative. The factor $W$ serves as a basis for the columns of $X$. In order to obtain more interpretable and unique solutions, minimum-volume NMF (MinVol NMF) minimizes the volume of $W$. In this paper, we consider the dual approach, where the volume of $H$ is maximized instead; this is referred to as maximum-volume NMF (MaxVol NMF). MaxVol NMF is identifiable under the same conditions as MinVol NMF in the noiseless case, but it behaves rather differently in the presence of noise. In practice, MaxVol NMF is much more effective to extract a sparse decomposition and does not generate rank-deficient solutions. In fact, we prove that the solutions of MaxVol NMF with the largest volume correspond to clustering the columns of $X$ in disjoint clusters, while the solutions of MinVol NMF with smallest volume are rank deficient. We propose two algorithms to solve MaxVol NMF. We also present a normalized variant of MaxVol NMF that exhibits better performance than MinVol NMF and MaxVol NMF, and can be interpreted as a continuum between standard NMF and orthogonal NMF. We illustrate our results in the context of hyperspectral unmixing.

artificial intelligence, machine learning, optimization problem, (13 more...)

2602.04795

Country:

North America > United States > Texas (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Belgium (0.04)

Genre: Research Report (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

Qi, Qianqian, van der Heijden, Peter G. M.

A review of NMF, PLSA, LBA, EMA, and LCA with a focus on the identifiability issue

arXiv.org Machine LearningDec-30-2025

Across fields such as machine learning, social science, geography, considerable attention has been given to models that factorize a nonnegative matrix into the product of two or three matrices, subject to nonnegative or row-sum-to-1 constraints. Although these models are to a large extend similar or even equivalent, they are presented under different names, and their similarity is not well known. This paper highlights similarities among five popular models, latent budget analysis (LBA), latent class analysis (LCA), end-member analysis (EMA), probabilistic latent semantic analysis (PLSA), and nonnegative matrix factorization (NMF). We focus on an essential issue-identifiability-of these models and prove that the solution of LBA, EMA, LCA, PLSA is unique if and only if the solution of NMF is unique. We also provide a brief review for algorithms of these models. We illustrate the models with a time budget dataset from social science, and end the paper with a discussion of closely related models such as archetypal analysis.

artificial intelligence, machine learning, natural language, (18 more...)

2512.22282

Country:

North America > United States (0.68)
Asia (0.68)
Europe (0.46)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Nguyen, Manh, Pimentel-Alarcón, Daniel

Nonnegative Matrix Factorization through Cone Collapse

arXiv.org Artificial IntelligenceDec-10-2025

Nonnegative matrix factorization (NMF) is a widely used tool for learning parts-based, low-dimensional representations of nonnegative data, with applications in vision, text, and bioinformatics. In clustering applications, orthogonal NMF (ONMF) variants further impose (approximate) orthogonality on the representation matrix so that its rows behave like soft cluster indicators. Existing algorithms, however, are typically derived from optimization viewpoints and do not explicitly exploit the conic geometry induced by NMF: data points lie in a convex cone whose extreme rays encode fundamental directions or "topics". In this work we revisit NMF from this geometric perspective and propose Cone Collapse, an algorithm that starts from the full nonnegative orthant and iteratively shrinks it toward the minimal cone generated by the data. We prove that, under mild assumptions on the data, Cone Collapse terminates in finitely many steps and recovers the minimal generating cone of $\mathbf{X}^\top$ . Building on this basis, we then derive a cone-aware orthogonal NMF model (CC-NMF) by applying uni-orthogonal NMF to the recovered extreme rays. Across 16 benchmark gene-expression, text, and image datasets, CC-NMF consistently matches or outperforms strong NMF baselines-including multiplicative updates, ANLS, projective NMF, ONMF, and sparse NMF-in terms of clustering purity. These results demonstrate that explicitly recovering the data cone can yield both theoretically grounded and empirically strong NMF-based clustering methods.

artificial intelligence, cone, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2512.07879

Country: North America > United States > Wisconsin (0.14)

Genre: Research Report (0.84)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.34)

Greg Van Buskirk, Ben Raichel, Nicholas Ruozzi

Sparse Approximate Conic Hulls

Neural Information Processing SystemsNov-21-2025, 10:42:20 GMT

Neural Information Processing Systems http://nips.cc/

algorithm, artificial intelligence, machine learning, (17 more...)

Country:

North America > United States > Texas > Dallas County > Richardson (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)
North America > United States > Arizona (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Neural Information Processing SystemsNov-20-2025, 20:31:23 GMT

Dropping Symmetry for Fast Symmetric Nonnegative Matrix Factorization

Zhihui Zhu, Xiao Li, Kai Liu, Qiuwei Li

NMF is equivalent to the classical K -means kernel clustering in [11]and it is inherently suitable for clustering nonlinearly separable data from a similarity matrix [10].

algorithm, artificial intelligence, machine learning, (17 more...)

Country:

North America > United States > Colorado > Jefferson County > Golden (0.14)
Asia > China > Hong Kong (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

arXiv.org Machine LearningNov-11-2025

A Provably-Correct and Robust Convex Model for Smooth Separable NMF

Pan, Junjun, Leplat, Valentin, Ng, Michael, Gillis, Nicolas

Nonnegative matrix factorization (NMF) is a linear dimensionality reduction technique for nonnegative data, with applications such as hyperspectral unmixing and topic modeling. NMF is a difficult problem in general (NP-hard), and its solutions are typically not unique. To address these two issues, additional constraints or assumptions are often used. In particular, separability assumes that the basis vectors in the NMF are equal to some columns of the input matrix. In that case, the problem is referred to as separable NMF (SNMF) and can be solved in polynomial-time with robustness guarantees, while identifying a unique solution. However, in real-world scenarios, due to noise or variability, multiple data points may lie near the basis vectors, which SNMF does not leverage. In this work, we rely on the smooth separability assumption, which assumes that each basis vector is close to multiple data points. We explore the properties of the corresponding problem, referred to as smooth SNMF (SSNMF), and examine how it relates to SNMF and orthogonal NMF. We then propose a convex model for SSNMF and show that it provably recovers the sought-after factors, even in the presence of noise. We finally adapt an existing fast gradient method to solve this convex model for SSNMF, and show that it compares favorably with state-of-the-art methods on both synthetic and hyperspectral datasets.

artificial intelligence, data mining, machine learning, (14 more...)

2511.07109

Country:

Asia > China > Hong Kong (0.04)
Europe > Belgium (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.87)

Barbarino, Giovanni, Gillis, Nicolas, Saha, Subhayan

Robustness of Minimum-Volume Nonnegative Matrix Factorization under an Expanded Sufficiently Scattered Condition

arXiv.org Machine LearningNov-7-2025

In fact, low-rank approximations are a central tool in data analysis, being equivalent to linear dimensionality reductions techniques, with PCA and the truncated SVD as the workhorse approaches [60, 59, 45]. However, due to the sheer number of possible such decompositions, the information provided is hardly interpretable. This motivated researchers to introduce more constrained low-rank approximations. Among them, nonnegative matrix factorization (NMF) focuses on nonnegative input matrices X and imposes the factors, W and H, to be nonnegative entry-wise. Nonnegativity is motivated by physical constraints, such as nonnegative sources and activations in hyperspectral imaging [9], chemometrics [15] and audio source separation [52], and by probabilistic modeling, such as topic modeling [39, 3] and unmixing of independent distributions [38]. Moreover, NMF leads to an easily-interpretable and part-based representation of the data [39]. See also [13, 19, 25] and the references therein.

artificial intelligence, cone, machine learning, (17 more...)

2511.04291

Country: North America > United States (0.45)

Genre: Research Report (0.63)

Industry: Health & Medicine (0.45)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)